The Lexicon-Grammar of Italian Idioms
نویسنده
چکیده
This paper presents the Lexicon-Grammar classification of Italian idioms that has been constructed on formal principles and, as such, can be exploited in information extraction. Among MWEs, idioms are those fixed constructions which are hard to automatically detect, given their syntactic flexibility and lexical variation. The syntactic properties of idioms have been formally represented and coded in binary matrixes according to the Lexicon-Grammar framework. The research takes into account idioms with ordinary verbs as well as support verb idiomatic constructions. The overall classification counts 7,000+ Italian idioms. In particular, two binary matrixes of two classes of idioms will be presented. The class C1 refers to the Verb + Object constructions, whereas the class EPC refers to the prepositional constructions with the support verb essere. Pre-constructed lexical resources facilitate idioms retrieval both in the case of "hybrid" and "knowledge-based" approaches to Natural Language Processing.
منابع مشابه
Words and Idioms
1. Idioms are constructions 1.1 Idioms in the Chomskyan paradigm Croft and Cruse (2004:225) describe the role of idioms in construction grammar as follows: “It is not an exaggeration to say that construction grammar grew out of a concern to find a place for idiomatic expressions in the speaker’s knowledge of a grammar of their language”. The reason for this focus on idioms in early construction...
متن کاملA Generative Grammar Approach For The Morphologic And Morphosyntactic Analysis Of Italian
A morphologic and morphosyntactic analyzer for the Italian language has been implemented in VM/Prolog 131 at the IBM Romc Scientific Center as part of a project on text understanding. Aim of this project is the development of a prototype which analyzes short narrative texts (press agency news) and gives a formal representation of their "meaning" as a set of first order logic expressions. Questi...
متن کاملImplementing European Portuguese Verbal Idioms in a Natural Language Processing System
This paper is based on an extant lexicon-grammar of European Portuguese verbal idioms (e.g., deitar mãos à obra, literally, ̳to throw hands to the work‘, ̳to start working‘.). This a database containing about 2,400 expressions, along with all relevant information on the sentence structure, distributional constraints and transformational properties of these frozen sentences. In this paper, we pr...
متن کاملThe Generation of Idiomatic and Collocational Expressions
Collocations whose semantic content is not or only partially composed from the semantic content of their parts are often viewed as problematic for generation. In this paper a tactical generator combining FUF as the generation engine and HPSG as the grammar framework is presented. It is shown, that the lexicon driven approach to syntactic and semantic processing is well-suited for the generation...
متن کاملA Cognitive Model of Sentence Interpretation: the Construction Grammar approach
This paper describes a new, psychologically-plausible model of human sentence interpretation, based on a new model of linguistic structure, Construction Grammar. This on-line, parallel, probabilistic interpreter accounts for a wide variety of psycholinguistic results on lexical access, idiom processing, parsing preferences, and studies of gap-filling and other valence ambiguities, including var...
متن کامل